Uyghur Short Text Classification Using Morphological Information
نویسندگان
چکیده
منابع مشابه
Uyghur Short Text Classification Using Morphological Information
In this paper, we propose a novel method for improving the classification performance of short text strings using conditional random fields (CRFs) that combine morphological information. Experimental results on three datasets (Uyghur, Chinese, and English) demonstrate that our method can yield higher classification accuracy than Support Vector Machine (SVM) classifier and Maximum Entropy Model ...
متن کاملNoisy Uyghur Text Normalization
Uyghur is the second largest and most actively used social media language in China. However, a non-negligible part of Uyghur text appearing in social media is unsystematically written with the Latin alphabet, and it continues to increase in size. Uyghur text in this format is incomprehensible and ambiguous even to native Uyghur speakers. In addition, Uyghur texts in this form lack the potential...
متن کاملText Classification in Information Retrieval using Winnow
Text classification in Information Retrieval can be done by using a linear classifier. Linear learning algorithms classify documents by learning a linear separator based on the document features. Littlestone's Winnow is such as linear learning algorithm. I have described three learning algorithms, based on Littlestone's Winnow, which can be applied to perform this task. Modifications of the alg...
متن کاملTopic Modeling and Classification of Cyberspace Papers Using Text Mining
The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...
متن کاملFeature Selection for Short Text Classification using Wavelet Packet Transform
Text classification tasks suffer from curse of dimensionality due to large feature space. Short text data further exacerbates the problem due to their sparse and noisy nature. Feature selection thus becomes an important step in improving the classification performance. In this paper, we propose a novel feature selection method using Wavelet Packet Transform. Wavelet Packet Transform (WPT) has b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Research in Computing Science
سال: 2015
ISSN: 1870-4069
DOI: 10.13053/rcs-90-1-26